Generalizing Manipulations using Vision Kernels
نویسندگان
چکیده
In order to perform complex manipulation tasks, a robot must know which actions it can perform with the available objects. In unstructured environments, potential manipulations afforded by objects will not be pre-specified, and must instead be learned. Rather than determining each novel object’s affordances from scratch, the robot can learn more efficiently by generalizing manipulations from similar known objects. Actions can be generalized to new objects by learning direct mappings from the object’s visual features to actions [1]. This approach differentiates itself from indirect methods by not requiring intermediate representations, such as object classes [2]. A robot can autonomously learn the afforded actions of an object by applying the action to the object and observing the resulting effects [3], [4]. If the desired effect is achieved, then the object can be labeled as affording this action. Thus, this affordance learning task can be treated as a binary classification problem for a given action. Our approach is based on two key insights: 1) The perception of objects and the interactions between objects are based largely on the objects’ surface geometries [1], and 2) the affordances of objects are often related to only subparts of objects and not the whole object [5]. Therefore, we propose generalizing actions to new objects by finding subparts of objects that have similar shapes and are, therefore, more likely to have the same affordances. The subparts of objects are represented in a nonparametric manner, which is based directly on the observed point clouds of the subparts. Thus, the robot does not rely on task-specific visual features, and can discriminate between any subparts that are not visually identical. Using this nonparametric representation, we also define a kernel function for computing the similarity between different subparts. Hence, we can use kernel learning methods [6], such as kernel logistic regression, in order to learn which subparts afford a given action. The proposed method was successfully tested on a real robot, as shown in Fig. 1. Starting with a single human demonstration of the task, the robot was able to learn to generalize this action to novel objects of different shapes and sizes.
منابع مشابه
Generic polar harmonic transforms for invariant image representation
This paper introduces four classes of rotation-invariant orthogonal moments by generalizing four existing moments that use harmonic functions in their radial kernels. Members of these classes share beneficial properties for image representation and pattern recognition like orthogonality and rotation-invariance. The kernel sets of these generic harmonic function-based moments are complete in the...
متن کاملKernels for Semi-Structured Data
Semi-structured data such as XML and HTML is attracting considerable attention. It is important to develop various kinds of data mining techniques that can handle semistructured data. In this paper, we discuss applications of kernel methods for semistructured data. We model semi-structured data by labeled ordered trees, and present kernels for classifying labeled ordered trees based on their ta...
متن کاملOn Power-Law Kernels, Corresponding Reproducing Kernel Hilbert Space and Applications
The role of kernels is central to machine learning. Motivated by the importance of power-law distributions in statistical modeling, in this paper, we propose the notion of powerlaw kernels to investigate power-laws in learning problem. We propose two power-law kernels by generalizing Gaussian and Laplacian kernels. This generalization is based on distributions, arising out of maximization of a ...
متن کاملFinding Optimal Combination of Kernels using Genetic Programming
In Computer Vision, problem of identifying or classifying the objects present in an image is called Object Categorization. It is a challenging problem, especially when the images have clutter background, occlusions or different lighting conditions. Many vision features have been proposed which aid object categorization even in such adverse conditions. Past research has shown that, employing mul...
متن کاملNonlinear characterization of a simple process in human vision.
Perceptual processes are often modeled as linear filters followed by a decisional rule. This simple model is central to the understanding of visual processing in humans. Its scope may be extended to capture a wider range of behaviors by the addition of nonlinear operators or kernels, but there is no evidence in human sensory processing that these operators are able to enhance the linear descrip...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012